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PROGRAMMABLE I/O STRUCTURE FOR FPGAs AND THE LIKE 
HAVING REDUCED PAD CAPACITANCE 



CROSS-REFERENCE TO RELATED APPLICATIONS 
The subject matter of this application is related to U.S. patent application no. 09/xxx,xxx filed on 
the same date as the present application as attorney docket no. 1054.026, the teachings of which are 
incorporated herein by reference. 

TECHNICAL FIELD 

The present invention relates to programmable devices, such as field-programmable gate arrays 
(FPGAs), and, in particular, to the input/output (I/O) interfaces for such devices. 

RACKOROUND 

A field-programmable gate array is a programmable device typically having a logic core 
surrounded by a ring of input/output (I/O) buffers. As sihcon technology migrates to smaller and smaller 
devices and as core logic densities increase, the ratio of core logic to I/O buffers increases. In general, 
however, the physical size of I/O buffers has not been shrinking as fast as the physical size of the core 
logic. One solution to this phenomenon has been to implement narrow, elongated I/O buffers around the 
periphery of newer devices. Unfortunately, this approach significantly increases the cost of the devices. 

Another factor contributing to the relatively large size of I/O buffers is the trend towards 
providing I/O circuitry with multiple different types of buffers connected to indrvidual pads, for example, 
order to support multiple different signaling applications. Unfortunately, this increased amount of 
try results in increased levels of capacitance as seen from the pads, which increased capacitance 
adversely affects the higher-performance signaling applications. 



in 

circui 



SUMMARY 

Problems in the prior art are addressed in accordance with the principles of the present invention 
by implementing programmable I/O buffers with circuitry that is shared by multiple pads. In that case, 
certain subsets of the circuitry that are not currently being used for one pad can be simultaneously used 
for another pad, thereby increasing the efficiency of use of the available I/O circuitry. Moreover, by 
implementing the selectivity capability using transmission gates connected between the outputs of output 
buffers and the corresponding pads, the capacitive load at a pad is reduced when a particular output 
buffer is not being used to drive an outgoing signal to the pad by opening the corresponding transmission 
gate. 
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In one embodiment, the present invention is a programmable device having programmable 
input/output (I/O) circuitry and programmable logic connected to receive incoming signals from and 
provide outgoing signals to the I/O circuitry. The programmable device comprises a first pad and a 
programmable I/O circuit (PIC) associated with the first pad. The PIC comprises a first output buffer and 
a first transmission gate. The first output buffer is adapted to present a first outgoing signal at the first 
pad. The first transmission gate is connected between the first pad and the first output buffer. The first 
transmission gate is adapted to be closed when the first output buffer is selected to present the first 
outgoing signal at the first pad. The first transmission gate is adapted to be open when the first output 
buffer is selected not to present the first outgoing signal at the first pad, wherein capacitive loading at the 
first pad due to the first output buffer is lower when the first transmission gate is open than when the first 
transmission gate is closed. 

RRTF.F DESCRIPTION OF T HF. DRAWINGS 

Other aspects, features, and advantages of the present invention will become more fully apparent 
from the following detailed description, the appended claims, and the accompanying drawings in which 
like reference numerals identify similar or identical elements. 

Fig. 1 shows a high-level block diagram of the layout of an exemplary FPGA of the present 

invention; 

Figs. 2-4 show schematic diagrams of programmable I/O buffers (PIBs) for the FPGA of Fig. 1, 
according to three different embodiments of the present invention; 

Fig. 5 shows a schematic diagram of output leg circuitry for the PB of Fig. 4, according to one 

embodiment of the present invention; 

Fig. 6 shows a schematic diagram of output leg circuitry that can be used to support a PIB having 
a current-mode logic (CML) output buffer for serial/deserial (SerDes) type applications; and 

Fig. 7 shows a schematic diagram of a transmission gate connected between the output of a push- 
pull buffer and a pad, such as may be used in any of the PIBs of Figs. 2-4. 

DETAILED DESCRIPTION 
Reference herein to "one embodiment" or "an embodiment" means that a particular feature, 
structure, or characteristic described in connection with the embodiment can be included in at least one 
embodiment of the invention. The appearances of the phrase "in one embodiment" in vanous places in 
the specification are not necessarily all referring to the same embodiment, nor are separate or alternative 
embodiments necessarily mutually exclusive of other embodiments. 
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FPGA Architecture 

Fig. 1 shows a high-level block diagram of the layout of an exemplary FPGA 100 of the present 
invention, having a logic core 102 surrounded by an input/output (I/O) ring 104. Logic core 102 includes 
an array of programmable logic blocks (PLBs) 106 intersected by rows of block memory 108. Each PLB 

5 contains circuitry that can be programmed to perform a variety of different functions. The memory 

blocks in each row are available to store data to be input to the PLBs and/or data generated by the PLBs. 
I/O ring 104 includes sets of I/O buffers 110 programmably connected to the logic core by 
multiplexor/demultiplexer (mux/demux) circuits 112. The I/O buffers support external interfacing to 
FPGA 100. Also located within the I/O ring are a number of phase-locked loop (PLL) circuits 114 that 

10 are capable of providing different timing signals for use by the various elements within FPGA 100. 
Those skilled in the art will understand that FPGAs, such as FPGA 100, will typically include other 
elements, such as configuration memory, that are not shown in the high-level block diagram of Fig. 1 . In 
addition, general routing resources, including clocks, buses, general-purpose routing, high-speed routing, 
etc. (also not shown in Fig. 1), are provided throughout the FPGA layout to programmably interconnect 

1 5 the various elements within FPGA 100. 

The layout of an FPGA, such as FPGA 100 of Fig. 1, comprises multiple instances of a limited 
number of different types of blocks of circuitry. For example, an I/O ring may contain a number of 
instances of the same basic block of circuitry repeated around the periphery of the device. In the 
example of FPGA 100, I/O ring 104 is made up of multiple instances of the same basic programmable 

20 I/O circuit (PIC), where each PIC provides a particular number (e.g., three) of the I/O buffers of the I/O 



ring. 



Exem plary Programmable I/O Buffers 

Fig. 2 shows a schematic diagram of a programmable I/O buffer (PIB) 200 for FPGA 100 of Fig. 

25 1, according to one embodiment of the present invention. As shown in Fig. 2, PIB 200 provides I/O 
circuitry associated with four I/O silicon pads A-D. In particular, PIB 200 has six push-pull output 
buffers 202, four input receivers 204-210, and a differential output buffer 212, variously connected 
between pads A-D and other internal FPGA circuitry (not shown). 

More particularly, input receiver 204 is connected to receive incoming signals from pad A, input 

30 receiver 206 is connected to receive incoming signals from pad B, input receiver 208 is connected to 
receive incoming signals from pad C, and input receiver 210 is connected to receive incoming signals 
from pad D. In addition, the differential outputs from differential buffer 212 are connected to provide a 
differential outgoing signal to pads A and C. Furthermore, each of output buffers 202a-c is connected to 
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provide an outgoing signal to pads A and B via a corresponding pair of transmission gates 214 and 216, 
respectively. Similarly, each output buffer 202d-f is connected to provide an outgoing signal to pads C 
and D via a corresponding pair of transmission gates 214 and 216, respectively. In a preferred 
implementation, each transmission gate 214, 216 is independently controllable to be either on (closed) or 
5 off (open). 

The circuitry of PIB 200 of Fig. 2 enables the four pads A-D to support a wide range of different 
operational modes. In theory, different possible modes of operation can be hnplemented from different 
combinations of the following capabilities for the four different pads A-D: 
Pad A 

1 o o Incoming signal drives input receiver 204; 

o Differential buffer 212 drives half of differential outgoing signal; and/or 

o Any combination of one or more output buffers 202a-c drives "combined" outgoing signal. 

PadB 

o Incoming signal drives input receiver 206; and/or 
15 o Any combination of one or more output buffers 202a-c drives "combined" outgoing signal. 

PadC 

o Incoming signal drives input receiver 208; 

o Differential buffer 212 drives other half of differential outgoing signal; and/or 

o Any combination of one or more output buffers 202d-f drives "combined" outgoing signal. 

20 PadP 

o Incoming signal drives input receiver 210; and/or 

o Any combination of one or more output buffers 202d-f drives "combined" outgoing signal. 
One of the advantages of the design of PIB 200 is the efficient use of circuitry. For example, 
although pad A can be simultaneously driven by all three output buffers 202a-c, if only a subset of those 

, . - « a jn mti Vip simultaneously used 

25 output buffers is used to drive pad A, the one or two unuseu ua^ut^l— - 

to drive pad B, by appropriately setting the states of the corresponding transmission gates 214 and 216. 
Moreover, if one or more of output buffers 202a-c are used to drive pad B, pad A can be simultaneously 
operated in a differential output mode via differential output buffer 212 or in an input mode via input 
receiver 204. Thus, while circuitry is provided to enable pads A and C to be operated in a number of 
30 different input and output modes, certain subsets of that circuitry that are not currently used for pads A 
and C can be simultaneously used for pads B and D, thereby increasing the efficiency of use of the 
available circuitry. 
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Fig. 3 shows a schematic diagram of a PB 300 for FPGA 100 of Fig. 1, according to another 
embodiment of the present invention. PB 300 is similar to PB 200 of Fig. 2, except that, instead of 
having a unique input receiver for each pad, PB 300 has shared input receivers. In particular, mput 
receiver 304 is connected to receive an incoming signal from either pad A or pad B. Similarly, input 
receiver 308 is connected to receive an incoming Signal from either pad C or pad D. The selection of 
which pad drives the input receiver is provided by muxes 318. PB 300 of Fig. 3 may be able to be 
implemented in a smaller area than PB 200 of Fig. 2, albeit with reduced functional capabilities. 

Fig 4 shows a schematic diagram of a PB 400 for FPGA 100 of Fig. 1, according to yet another 
embodiment of the present invention. PB 400 is similar to PB 300 of Fig. 3, except that, unlike 
differential output buffer 312 of Fig. 3, which drives output pads A and C with the two legs of a 
differential output signal, positive and negative outputs 420 and 422 of differential output buffer 412 
apply the two legs of a differential output signal to output leg circuitry 424. Circuitry 424 enables the 
two legs of the differential output signal to be selectively applied either to pads A and C, respectively, or 

to pads B and D, respectively. 

In one implementation, the selection of either pad pair A/C or pad pair B/D is mutually exclusive, 
such that PB 400 supports three different operating modes for differential output buffer 412: a first 
mode in which pad pair A/C is driven, a second mode in which pad pair B/D is driven, and a third mode 
in whichneither pad pair is driven(e.g., when buffer412 is turned off). Note that this mutual exclusivity 
is not necessary for all implementations. Such "non-exclusive" implementations may support a fourth 
operating mode in which both pad pairs are driven at the same time with a single differential output 
signal from buffer 412. Furthermore, in alternative embodiments, the PB could have two differential 
output buffers, where each buffer drives a different pad pair. 

The "mutually exclusive" implementation can be used to enable the FPGA design to support both 
flip-chip and wire-bond packaging configurations in an efficient manner. Prior-art FPGA designs that 

. . c. diffarotial output buffers in which 

support both flip-chip and wire-bond pacicaging cum.su, a™, . ~ a - i 

each differential output leg has a single metallic structure that can support either a flip-chip bond or a 
wire bond If the FPGA is packaged in a flip-chip configuration, then that portion of the metallic output 
leg structure strictly corresponding to the wire bond would essentially be unused. Similarly, if the FPGA 
is packaged in a wire-bond configuration, then that portion of the metallic output leg structure strictly 
) corresponding to the flip-chip bond would essentially be unused. Although this prior-art solution 

supports both packaging configurations, the excess (i.e., unused) portions of the metallic output leg 

structures add undesirable capacitance at the output pads. 
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The principles of PB 400 can be employed to support either flip-chip or w,re-bond packagtng 
configurations by using outpu. leg circuitry 424 to drive, in a mutually exclusive manner, either pad patr 
A/C (e g associated with a fltp-chip configuration) or pad pair B/D (e.g., associated with a wtre-bond 
oonfigura'tion), where each metal„c stntcture connecting one of the two differentia, buffer outputs wttb a 
particular pad is destgned to minimize capactive loading at the pad. In particular, when outpu, leg 
circuitry 424 is configured to drive one of the pad pairs, capacttance from the other pad pair is shtelded 

from the selected pad pair, and vice versa. 

Fig 5 shows a schematic diagram of output leg circuttry 424 accordtng to one embodiment of the 

present invention. As shown in Fig. 5, output leg cfrcuitry 424 has a positive current source 502 and a 

negative cwren, source 504 separated by a configuration of tranststors 506, 508 and switches 5.0, 512. 

For example, switch 5.0a is connected to the gate of tiansistor 506a, and switch 5.2a is connected to the 
ga ,e of switch 508a, while switch 510b is connected to (he gate of translator 506b, and swttch 512b ts 
connected to the gate of switch 508b. And analogously for transistors 506cd, 508c-d and switches 
510c-d 512c-d The node connecting tranststors 506a and 508a ts also connected to pad A, whtle the 
node connecting transistors 506b and 508b ts connected to pad B. Similarly, the node connecting 
transistors 506c and 508c is connected to pad C, while the node connecting tranststors 506d and S08d ts 
connected to pad D. 

With switches 510a, 512a, 510c, and 512c closed and switches 510b, 512b, 510d, and 512d 
open, signals corresponding to the two dtfferentia. .egs 420, 422 of a differential output signal from 
drfferential output buffer 412 of Fig. 4 will he presented at pads A and C, respectively. Simtlariy, wtth 
swttches 510., 512a, 510c, and 512c open and swttches 5.0b, 5.2b, SIM. and 5.2d closed, stgnals 
corresponding to the two differential legs 420, 422 will he ptesented at pads B and D, respectively. 

In a prefen-ed implementation, output leg circuitry 424 also tncludes a second (optional) posttive 
cutren, source 514 configured in parallel wtth posttive current source 502. Current source 514 ,s 

1 „. ~ ^nitacre PMV and an 

controlled by op amp 516 which is differentially onven by common- „- 

appropriate reference voltage Vref. Mux 518 selects the appropriate common-mode voltage to apply to 
op amp 516. In particular, when switches 510, 512 are configured to drive pads A/C, mux 5.8 ts 
configured to select .he common-mode vo.tage between pad A and pad C. Al.ema.ivdy, when swttches 
5,0, 512 are eonftgured to drive pads B/D, mux 518 ts configure* «o select the common-mode voltage 

3 between pad B and pad D. 

Operationally, current source 514 compensates for vanations in the common-mode voltage by 
dnving the output leg creamy to keep the selected common-mode voltage equal to me reference voltage 
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Vref In alternative unptanentauons, an anatogous additional current souree eotdd be added in paral.e! 
witt, negative current aource 504, either in addition to or instead of current source 514. 

Although no. shown in the figures, those skilled in the art will understand that pad A and pad 
are separated hy a series combutation of two substantially equa, resistors R, where the —tuode 
vohage between pads A and C is the vohage a, .he node connecting .he two res,s,ors. Warty pad B 
and pad D are separated by a series combination of two subst»«a„y equal resistors R, where the 

The value of each resistor R is preferab.y substantially larger (e.g., five times) than ttte ex.en.al 
termination applied between each pad pair. 

Note that in PB 400, output leg circuitiy 424 enables two different patrs of pads to be 
actively driven with a differential output signal geuerated by an otherwise shared outputbuffer ht 
p^ar.nontauerwmchoutpurpadpair is se.ected, the same reference ctrcuitiy (which genets 
Lference voices Vref, Pref, and Nref, and pre-ttrive circuitiy (represented elective,, m Ftg. 4 by 
buffer 4.2) are used to generate the signals on lines 420 and 422, wuhout any perfonnance tmpac Note, 
further, that relative., high capacitance a, node 520 improves perfonnanee hy absorbing jitter m ttte 
current, tttereby compensating for instability in current source 504. 

Although FIB 400 and output leg ctrcuttry 424 support wo different patrs of pads, the pnncples 
of this circuitiy can be extended to any number of pad pahs by providing addttional sets of drrve leg 
circuitry similar to that shown in Fig. 5. 
, P B 400 has differentia, output buffer 412, which may he suitable for low-vohage d.fferentia, 

' signaling (LVDS) appltcattons. The present invention can a,so be implemented in me context of a PB 

buffer where each of the output buffers is able to drive different pairs of pads. 

Fig . 6 shows a schematic diagram of output leg ctrcutey 600 that can be used to sup,**, a 

. „ r ;.i/j rt .^oi ^prDp^ tvne aDolications. In one 

!5 current-mode logic (CML) output ouner ior seiiauucovrw v . . 

embodiment, output legetrcuitty 600 replaces ou,pu, ,e g creuitry 424 of P., when . P* h, 
CML outputbuffer in p,ace of differentia, output buffer 412. In another embodtment, me PB has 
CMlou*u,buffer 1 nadd,tion,odifferen«a,outipu,buffe,4,2,whereoutipu,,ege 1 romtiy60.s 

from pre-drive circuitry of the CML output buffer. 
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As with output leg circuitry 424 of Fig. 4, the principles of output leg ctreuitry 600 cau be 
expanded to support one or more additional pad pair, by addtng on. or nrore addtttonal seta of output leg 
cranny Moreover, like output leg eircuiuy 424, the principles of output leg circuitry 600 can be used 
,„ provide FPGA des.gns <ha, efficiently support both flip-chtp and wtre-bond packaging configurattons. 
The pnnciples of output leg circuits 424 and 600 can be extended to any other suttable type of output 



buffer. 



15 



Transmission Gates 

In addition to supporting the selective driving of different pads, the transrmssion gates prov.de 

10 anotherbenefnofred^^^ 

202aisnotusedtodriveeimerof P adsAo^ 

open state, reduces the capacitive load that appears at pads A and B that would otherw.se be confuted 
by the circuitry of output buffer 202a. This reduction in capacitive load may be significant, especutily 
when the I/O buffers are configured for high-speed signaling applications. 

Fig 7 shows a schematic diagram of a transmission gate 702 connected between the output of a 
push-pullbuffer704andapadP, S uchasmaybeused 1 nanyofthePmsofF 1 g,2-4.In P articular, 

push-pull buffer 704 comprises /Vpush-pull "sub-buffers" connected in parallel, each sub-buffer , 
C omprising(l)aresistorRPiconnectedbetweenanegativelydnven transistor MP. and a shared node S 
of buffer 704 and (2) a resistor RNi connected between a positively driven transistor MN. and shared 

20 nodeS. ■ * a 

The resistors improve to linearity of to output impedance of buffer 704 as a funcfton of pad 
voltage and a.so provtdc impedance matching with to board impedance. Transmisston gate 702 tsola.es 
.he parastttc capacitance of to resistors from pad P. Without —ion gate 702, to pad capacitance 
would increase significantly due to to parastnc capacttanec of to resistors, which would degrade to 

212 of Fie. 2. 

25 performance of high-speed buffers, sunn aa dme. v.,.™ ™tpu 

' Transmtssion gate 702 comprises three switch devices TXi-3 connected in parafiel between 
push-pull buffer 704 and pad P. fnparftcu.ar, one side of to channe. of each switch device is connected 
» node S and to otor channe, side is connected to pad P. Turning off each switch device wil, p,ace 
tmnsmtssion gate 702 in an open (i.e., off) state, ,» which a signal from transmission gate 702 ,s no. 
30 presen.cda.padP. Ontoofherha»d,n,mingonanyoncormor=oftoswi«chdevtceswinp,ace 

transmission gate 702 in a closed (i.e., on) state in which a signal from transmission gate 702 . presented 
at pad P. 
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Although switch devices TX1-3 isolate parasitic capacitance of the resistors from pad P, their 
sizes are preferably optimized to meet the relevant I/O specification of push-pull output buffer 704 across 
all process, voltage, and temperature corners. At the same time, their junction capacitances are 
preferably small. To meet these goals, the size of the transmission gates are optimized such that their 
junction capacitances are small and the resistances are about 20-30% of the resistance of the n-well or 
poly resistors. In addition, three different control signals (and their complements) are used to enable one 
or more of transmission gates TX1-3 to compensate for variation of their own resistance due to process 
and temperature variations. Note that, while process variations are static, temperature variations may be 
dynamic, in which case the selection of the control signals for the different transmission gates can 
preferably be changed as needed as temperature changes. 

In one possible implementation, if transmission gate 702 is not implemented, the total area of the 
resistors in push-pull buffer 704 is about (63 microns x 63 microns) or about 4000 microns 2 . In current 
deep sub-micron technology, the area junction capacitance C } is about lxlO" 3 farads/meter 2 and side-wall 
capacitance C sw is about 1.2x10 '° farads/meter. Thus, the total capacitance C RES seen by pad P will be: 

Cres = (4000//m A 2 x le- 3f /m A 2) + (4x 63/mix 1.2e-10f/m) 
= 4 pf + 0.03 pf 
= 4.03 pf 



When transmission gate 702 is implemented to provide isolation between single-ended buffer 704 
and a differential output buffer, such as differential output buffer 212 of Fig. 2, pad P will be exposed to 
the junction capacitance of transmission gate 702 rather than the capacitance of buffer 704. 
Transmission gate 702 can be sized to contribute about 20-30% or more of the resistance of n-well or 
poly resistors and still maintain sufficient linearity of the resistance. In one implementation, the total 
transmission gate size is about 5 x (120+60) or 900 microns. The total capacitance C TGATE seen by pad P 
will then be: 

Ctgate = (90tyim x 1 .95 /mi x le-3tfm A 2) + (2 x (1 .95//m+900//m) x 1 .2e-10f/m) 

1.76 pf+ 0.22 pf 
1.98 pf 
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15 



where ,55 — i S »* ~* *— * Msis,ws - ™ s :r°" ds ,o about a 

5 , % reduction in capacitance seen a. pad P by incorporating transmisston gate 702. 

numberofoneormoreawttchdevtces. and Switch device" are not necessary 

Note that, as used in this specification, the terms swncn 

e q u„a,e„,. For exantp.e, —ion gare ,02 is an examp.e of a type of swirch that ,s ..self 

implemented using three different switch devices. 

AUhoughmePBsofthepresentinvenhonhavebeendesonbedinutecontextof 

output buffers could alternatively be used. ^.whuflers 
Although, for example, PIB 200 of Fig. 2 is implemented with two sets of three output buffer 
^JllFBsacrdmgrothepreaen, — may be ^>en»«d w,m any stnUb, nnmber of 
202, in general, c im n ar i v although PB 200 is implemented as being associated 

one or more output buffers m each set. Similarly, although 

. —1 PTRs according to the present invention may be implemented wun y 

w «h four pads, (h inventta ^ ta imptanent ed without a dtfferentia. 

number o pad, F— PB P ^ ^ ^ ^ ^ rf fc ^ _ ,„ 

-~==="=^ 
=—=====«=== 

Uimplementedbenveenthatoutputhufferandrhatpad. n f FpGAs those skilled in the 

Although the present inventton has been described tn the context of FPOAs, those sht 

, progmmmable device, such as, without limitation, programmable logic devtees (PLDs, 
Tt will be further understood that vanous changes in the details, maxer 



30 

following claims 
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